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1. Introduction 

Entering a new social group, we are vividly interested in all kinds of non- 
formal contacts. They are necessary to interpret and qualify properly all 
information we get: as relevant or marginal, unique or commonly available, 
urgent or not so, etc. We are taught by evolutionary psychology that 
this need reflects the way of work of the human brain, as it has been formed 
during millions of years of evolution. This need forms then our today's 
relations with people as well. As a consequence, it remains relevant for 
any social theory of human relations. That is why gossip appeared as an 
appealing catchword [2J @J 03 HE E] . Because of its roots noted above, 
theory of gossip can be seen as a part of evolutionary psychology. Once an 
evolutionary sociology emerges jHJ, we will certainly find it there. 

In sociophysics, we look at social sciences through a mathematical glass. 
Being somewhat blind to hermeneutical analyzes, we look for determinism, 
structure and numbers. Such an attitude meets an old hope of sociologists 
to deal with problems as well-defined and narrow as physicists have. (Invo- 
cations to physics are quite frequent in old sociological textbooks [TU1 ITT] , 
to call only few examples.) Sociology can meet with sociophysics in all cases 
where the structure of society is of importance. By structure we mean a 
system of mutual or directed connections between people. In a reduction- 
istic approach, such a system can be represented by a graph, where people 
are nodes and relations between people are links. Quite naturally, such a 
picture is a favorite tool in sociophysics. 

Here we are going to use this mathematical representation to analyze the 
spreading of gossip. The starting point is the theory and experiment pro- 
posed and performed recently by some of present authors |12j . According to 
this theory, "gossip is non-public information about knowable people and its 
primary attribute is proliferation. Gossiping is a communicative propensity 
characteristic of the human race manifesting itself in smaller communities" . 
Then, the person who is the subject of the gossip is known personally to 
the community. This fact makes the gossip interesting and this interest is 
the necessary condition of the gossip spreading. This "semiprivate" charac- 
ter makes our case different from the theory of rumor by Galam ^3]. The 
experiment ^1] dealt with an interest in gossip about a known or knowable 
person in a web-based social network. As a result, three levels of social 
inclusion have been found, which practically limited the gossip spreading. 
As a sample of the questionnaire, answers were gathered to the following: 

1. Would you tell about your girlfriend's new job to your friend? 

2. Would you tell about your girlfriend's new job to your friend's girl- 
friend? 
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3. Would you tell about your girlfriend's new job to your friend's girl- 
friend's colleague? 

The percentage qi of positive answers varied from q% = 100% through qi = 
74.8% till qs = 22.1%, respectively for questions 1, 2 and 3. After the third 
degree the results had shown a sharp decline |14j . 

These considerations led us to our main question, under what conditions 
a given gossip will be known in the whole community? The above numbers 
qi (i = 1, 2, 3) served us as probabilities, that the gossip will be told to people 
of 1-st, 2-nd and 3-rd level of inclusion, defined by the questionnaire. At this 
point we are faced with the as-yet-unsolved problem, what is the structure of 
the social network? We have to admit that the answer varies from one kind 
of network to another, one or another kind of social ties. In the literature 
of the subject, one can find arguments about different parameters of social 
networks: size from a hundred to three hundreds and more [21 I15[ I16j . 
clustering coefficient [T7] . strength of ties JH] and structure ^1120]. The 
results can depend also on whether complete networks or personal networks 
are investigated [U- When we speak on friendly personal networks, the size 
of a typical group can decrease by at least one order of magnitude 22 . Not 
entering these discussions, here we attack the problem of gossip spreading 
in a model way, where the average number of friends is a model parameter. 
Also, for simplicity we choose the random graph of Erdos and Renyi |24j as 
a model of a social network. This selection should serve as a useful point of 
reference. 

The goal of this paper is to calculate the probability that the gossip is 
known, averaged over the community members. Basically, the result is close 
to zero or one, except some range of the average number of friendship ties. 
This range can be seen as the range of a transition between two phases: 
"they do not know" and "they know". However, even if the width of this 
range eventually shrinks to zero in the limit of large networks, this limit is 
not relevant for social sciences, where the quality of useful approximations 
does not necessarily increase with the system size. 

In next section we describe the model calculations and the results. Last 
section is devoted to their discussion. 

2. Calculations and results 

From noted above, the following model assumptions emerge: 

1. The set of nodes are those who "know about", and that is why they 
are willing to hear. 

2. The links join two nodes if they are friends. 
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3. The linkage is random, as in the Erdos-Renyi model. 

4. The question is to evaluate the size of the group who will know the 
information. 

The detail is if the victim of the gossip is also a member of the network 
in which gossip is spread. The argument for this assumption is in the 
questionnaire "about your girlfriend's new job". In this case the talker role 
is to be limited to the set of boyfriends of the girl. However, we assume 
that the girl can have more boyfriends, and then the number of talkers can 
be greater than one. 

In this case we have two parameters: ./V (the number of nodes) and p 
(the probability of a link of friendship between randomly selected nodes). 
As we know from the theory of random networks [27], the mean degree is 
z = p(N — 1). The numbers q%, q 2 and q% can be interpreted as weights 
in the average level of being informed about a given gossip. All friends of 
the girl who got a job will know it with probability one {q\ = 1). This is a 
contribution z. Their friends (each has z — 1 still not informed) will know 
the gossip with probability q 2 . This is a contribution q 2 z{z — 1). Finally, 
consider friends of the friends (supposed they are not informed yet). If each 
friend has z — 1 uninformed friends, the information will pass to them from 
the teller with, say, probability q 3 (z — l)z(z — 1). Then, total level X of 
being informed on the gossip would be 



This is a function of N and p = z/{N — 1). 

This expression has some deficiency: in the random networks the prob- 
ability that two "friends" of a node are also "friends" is z/(N — 1) = p. 
In the above calculation, we disregarded this possibility. Now we are go- 
ing to include it. In the first zone, z friends are informed with probability 
q\. Each has (z — 1) neighbors, p of them are already informed. Then, 
newly informed are only (1 — p) next neighbors, and their contribution will 
be q 2 z{z — 1)(1 — p). How many still non-informed neighbors have these 
z(z — 1)(1 — p) people? The answer is that each has (z — 1)(1 — p). They 
will be informed by a teller with probability q%. Then, their contribution is 
q 3 z(z — 1) 2 (1 — p) 2 ■ The total formula is: 



X = 



qiz + q 2 z(z - 1) + g 3 z(z - l) 2 
N - 1 



(1) 



X = 



qiz + q 2 z(z -l)(l-p)+ q 3 z(z - 1) 2 (1 - pf 
N - 1 



(2) 



We note that still there are some assumptions left about the lack of correla- 
tions of further order, the arguments are somewhat heuristic, and valid only 
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for small p. However, X obviously increases with p. In the range where the 
formula is not valid (large p) we rely on a computer simulation. 

For given victim of the gossip i (one of N nodes constituting network) 
and all t(i) of its nearest neighbors (talkers) we evaluate the number ni(i) 
[77-2(2)] of paths of length 1 [2] from all other nodes j to talkers. The prob- 
ability that j-th node is not informed is 

l-P j = (l-q 3 ) ni ®(l-q S ) r »®. (3) 
Then, the level of being informed on the gossip for given victim i is 

X, = ' W + ^, (4, 
N-l ' V ; 

where summation goes over all j 7^ i and j is not a talker. The total level 
X of being informed of the gossip is averaged over all possible victims of 
the gossip in the given network 

N 

X = Y J X l /N. (5) 

i=i 

We carry out our simulation for the set of probabilities qi = 1, q% = 0.748 
and q 3 = 0.221. 

It appears (Fig. ^) that at some value of p, almost everybody will know 
the gossip. This value of p is however not strictly defined and it depends 
on the system size N. For small p, both expressions (Eqs. Q and ©), for 
correlated and uncorrelated (i.e. random) case, work almost equally well. 

Motivated by tradition of statistical mechanics, we made an attempt to 
evaluate the probability p c , where X = 1/2. This p c can be seen as a critical 
value between the two phases remarked above, where "they do not know" 
for p < p c and "they know" for p > p c . The size dependence of p c , i.e. 
p c (N), is shown in Fig. |2J The results nicely fit a power law p c oc iV~ 7 . 
The exponent 7 slightly varies with the measured probabilities (q\, q<i and 
qs); it is 0.68 for the values of the probabilities (1.0, 0.748 and 0.221) used 
here, but 0.63 for (1.0, 0.7 and 0.25), 0.65 for (1.0, 0.75 and 0.25) and 0.66 
for (1.0, 0.8 and 0.2). 

We made also an attempt to evaluate the influence of the clustering 
coefficient C on our results. The coefficient C is defined as the ratio of 
number of links between ki nearest neighbors of i-ih site, divided by the 
maximal value of this number (ki(ki — l)/2) and averaged over all sites 
of the network with more than one neighbor. Our motivation comes from 
the suggestion j^H] that in social systems, the correlation is larger than for 
random case. The simulation is performed for N = 1000 and p = 0.0135, 
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Fig. 1. Average probability X of being informed on a gossip against (a) the prob- 
ability p and (b) mean node degree z. Theoretical curves (Eqs. (JTJ and differ 
only slightly for N = 100, but coincide for larger networks. Their accordance 
with the simulation results improves for larger N, where the correlations between 
informed neighbors can be neglected with better accuracy. 



which is equal to p c (N = 1000) for the random (i.e. uncorrelated) network. 
The clustering coefficient is increased by a rewiring procedure: a node is 
selected with at least K cut = 3 neighbors, and the link to one of its neighbor 
is cut; instead, it is added between two remaining neighbors. 

The result is that as C increases, the average size X of informed group 
decreases. Example of this result is shown in Fig. |21 It is clear that in 
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Fig. 2. Dependence of critical probability p c on the system size N. The solid line 
shows the least square fit p c oc 7V~ 7 with 7 ks 0.68. 

the case of larger C, information is transmitted more frequently within a 
small group. On the contrary, its spread over the whole community is less 
effective. This effect is parallel to the discussion in sociological literature, 
where links joining different compact groups (the so-called weak ties) are 
considered to be crucial for the information spreading |18j . 



When a social group is formed from the beginning, almost nobody knows 
anything about others. Soon mutual ties are built and strengthen, and 
information starts to flow. In our picture, this process can be interpreted 
as an increasing of the probability p in time. The results presented in Fig. 
1 indicate, that the information carried by gossips increases initially with p 
as a low degree polynomial. Gradually, the whole group becomes informed. 

Keeping the experimental values of qi constant, as we do, we can expect 
some characteristic distance b from the victim to a member who is informed 
with probability, say, 1/2. (This distance is a graph characteristics and it 
should not me mixed with the social distance, discussed elsewhere |2Ull3U] h 
Certainly, this distance depends on the numbers qi, i = 1,2,3. On the 
other hand, the diameter of the random network can be evaluated [23] as 
d = IniV/lnz. At p = p c we can expect that d = b. Approximating z by 
Np, we get b = ln(A r )/ ln(A r p c ), i.e. 



3. Discussion 



Pc = N 



1/6-1 



(6) 
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Fig. 3. Influence of the clustering coefficient C on the level of gossip propagation X 
for N = 1000. Increasing C and keeping p constant, we eventually get the network 
split in parts. 



Comparing this with our numerical result p c = N~~ / = N~ , we get 
6 close to 3.0. Having in mind our values of the probabilities qi, we are 
not surprized with this distance. Reasonably enough, it agrees with the 
interpretation of the experiment, given in our previous work entitled Three 
levels of inclusion |14| . We conclude that the exponent 7 is not universal, 
but it depends on the probabilities qi. With increasing p, the whole group 
is gradually dragged into the shell of radius b around the victim. Actually, 
the gossiping can be a good reason to enhance group ties. 

In statistical mechanics, our results may be relevant for the percolation 
problem in random networks. It is known that large connected clusters ap- 
pear for p > 1/N j2UH7j. Important difference is that in our case of gossip, 
we have one source of information. In this aspect, the gossip spread can be 
compared to a spread of infection, e.g. in networks of sexual interactions 
|31| . Once we allow for a distribution of sources within the network, the 
problem of gossip becomes alike to the family of problems, as bootstrap 
percolation [32] or diffusion percolation [33] EH] • It seems natural that these 
problems will find social applications, similar to those [35] of standard perco- 
lation theory [36] . The bottleneck here is the sociological experiment, which 
is much more difficult, costful and debatable than computer simulations. 
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